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ABSTRACT 

We present a Fourier analysis of the clustering of galaxies in the combined Main galaxy and Luminous 
Red Galaxy (LRG) Sloan Digital Sky Survey (SDSS) Data Release 5 (DR5) sample. The aim of our 
analysis is to consider how well we can measure the cosmological matter density using the signature 
of the horizon at matter-radiation equality embedded in the large-scale power spectrum. The new 
data constrains the power spectrum on scales 100-600 ft." ^ Mpc with significantly higher precision 
than previous analyses of just the SDSS Main galaxies, due to our larger sample and the inclusion 
of the LRGs. This improvement means that we can now reveal a discrepancy between the shape of 
the measured power and linear CDM models on scales 0.01 < k < G.15/iMpc~^, with linear model 
fits favouring a lower matter density {Qm — 0.22 ± 0.04) on scales 0.01 < k < 0.06 /iMpc"^ and a 
higher matter density (JIm = 0.32 ± 0.01) when smaller scales are included, assuming a flat ACDM 
model with h = 0.73 and Us — 0.96. The lower matter density favoured by fitting our SDSS data for 
0.01 < k < 0.06/iMpc~^ is a better match to the best-fit WMAP 3-year cosmological model, and to 
results from the positions of the baryon oscillations observed in the SDSS DR5 power spectrum. This 
discrepancy could be explained by scale-dependent bias and, by analysing subsamples of galaxies, we 
find that the ratio of small-scale to large-scale power increases with galaxy luminosity, so all of the 
SDSS galaxies cannot trace the same power spectrum shape over 0.01 < k < 0.2/iMpc~^. However, 
the data are insufficient to clearly show a luminosity-dependent change in the largest scale at which a 
significant increase in clustering is observed, although they do not rule out such an effect. Significant 
scale-dependent galaxy bias on large-scales, which changes with the r-band luminosity of the galaxies, 
could potentially explain differences in our Qm estimates and differences previously observed between 
2dFGRS and SDSS power spectra and the resulting parameter constraints. 
Subject headings: cosmology: cosmological parameters, large-scale structure of universe 
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1. INTRODUCTION 

The evolution of perturbations in the early universe 
imprints characteristic scales that depend on the aver- 
age matter density (Silk 1968; Peebles & Yu 1970; Sun- 
yaev & Zel'dovich 1970; Bond & Efstathiou 1984, 1987; 
Holtzman 1989). Fundamentally, the growth of fluctu- 
ations is intimately linked to the Jeans scale; perturba- 
tions smaller than the Jeans scale do not collapse due to 
pressure support, while larger perturbations are free to 
grow through gravity. In the radiation dominated era, 
the dark matter has negligible density compared to the 
photon-baryon fluid, and the perturbations in this fluid 
are stabilised by the high radiation pressure at a time 
when the sound speed was of order c/^/3. Consequently, 
the Jeans scale is of order the horizon scale until matter- 
radiation equality, after which it reduces to zero when 
the matter dominates. We therefore see that the hori- 
zon scale at matter-radiation equality will be imprinted 
in the distribution of fluctuations - this scale marks a 
turn-over in the growth rate of fluctuations. 

In a model with only coUisionless dark matter, all 
lengths scale with the horizon scale at matter radia- 
tion equality, which is a multiple of (r^M^^)"^, where 
h = iJo/lOOkms^^Mpc^^ Consequently, CDM trans- 
it SUPA; Institute for Astronomy, University of Edinburgh, 
Royal Observatory, Blackford Hill, Edinburgh, EH9 3HJ, UK 
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fer function fitting formulae (such as Eq. G3 of Bardccn 
et al. 1986) were traditionally created as a function of 
q = k/ (Qm/i^Mpc). When analysing galaxy redshift sur- 
veys, the comoving distance-redshift relation introduces 
another factor of h into the measurements, so the data 
constrain the transfer function in fc//iMpc^^. Conse- 
quently, fits between model and data constrain the de- 
generate parameter combination ^uh. 

Such cosmological constraints are important for break- 
ing degeneracies between cosmological parameters that 
exist when fitting just CMB data (Eisenstein et al. 1999: 
Efstathioii & Bond 1999; Percival et al. 2002). With 
increasingly precise temperature and polarisation CMB 
constraints (Hinshaw et al. 2006; Page et al. 2006; Spergel 
et al. 2006), the additional cosmological role of galaxy 
surveys to provide cross-checks is becoming increasingly 
important. In this paper, we carefully test the methodol- 
ogy behind using the shape of the galaxy power spectrum 
to provide cosmological constraints. 

Extracting the cosmological information encoded in 
the galaxy power spectrum has motivated many previ- 
ous surveys. Early studies, with of order 10** galaxies, 
include the CfA (Vogeley et al. 1992; Park et al. 1994), 
LCRS (Shectman et al. 1996) and PSCz (Saunders et 
al. 2000) surveys. These surveys were able to show that 
the shape of the power spectrum required a relatively 
small matter density, at odds with the simple Einstein- 
de Sitter model (e.g. Efstathiou et al. 1990; Tadros et al. 
1999). The same general shape of the power spectrum 
has been recovered by deprojecting the APM (Maddox et 
al. 1990, 1996) survey, the parent catalogue of the 2dF- 
GRS (Efstathiou & Moody 2001; PadiUa & Baugh 2003), 
and by deprojecting the photometric component of the 
SDSS catalogue (Scranton et al. 2002; Connolly et al. 
2002; Tegmark et al. 2002; Dodelson et al. 2002; Szalay 
et al. 2003). More recently, photometric redshifts have 
been exploited to improve the deprojection of galaxy dis- 
tances and the accuracy with which the power spectrum 
can be measured from photometric data. Two studies 
have recently been completed considering Luminous Red 
Galaxies (LRGs) within the SDSS survey (Padmanab- 
han et al. 2006; Blake et al. 2006), showing a marked 
improvement in accuracy on previous work. 

However, the most precise measurement of the 3- 
dimensional galaxy power spectrum is still recovered 
from redshift surveys. Over the past 5 years there has 
been a revolution in the number of galaxy redshifts mea- 
sured, and the size of the volmne probed by these sur- 
veys. Leading the way have been two large galaxy red- 
shift surveys, the 2dF Galaxy Redshift Survey (2dFGRS; 
Colless et al. 2001, 2003) and Sloan Digital Sky Survey 
(SDSS; York et al. 2000). In this paper we analyse the 
relative clustering strengths of galaxy samples as a func- 
tion of scale. The overall clustering amplitude is also 
potentially interesting in constraining cosmological mod- 
els, but takes more work to decode. This normalisation 
is known to be a strong function of both galaxy colour 
and luminosity (Park et al. 1994; Norberg et al. 2001, 
2002; Zehavi et al. 2002), so an midcrstanding of galaxy 
bias is required before cosmological constraints can be 
derived from this statistic. However, the effect of galaxy 
bias on the shape of the power spectrum is less certain, 
and is the subject of this paper. On small scales, where 



non-linear corrections to the matter power spectrum are 
important, the shapes of galaxy power spectra are known 
to depend on galaxy colour and luminosity (e.g. Cole et 
al. 2005). However, on large-scales, the effects on the 
shape are more uncertain. 

On larger scales where the matter in the universe is 
still expected to be predominantly in the linear regime 
k < 0.15ftMpc~^ (Smith et al. 2003), discrepancies 
currently exist between the shapes of the power spec- 
tra recovered from the 2dFGRS and SDSS. The prob- 
lem is demonstrated by the measurements of Om ob- 
tained from such fits. Assmning a Hubble parameter 
h = 0.72, the Tegmark et al. (2004) SDSS main galaxy 
analysis favoured Om = 0.296 ± 0.032. Similar val- 
ues of r^M — 0.3 were found by alternative analyses 
of the red selected SDSS main galaxy power spectrum 
(Pope et al. 2004), a set of SDSS spectroscopic Lumi- 
nous Red Galaxies (LRGs) (Eisenstein et al. 2005), and 
SDSS photometrically selected LRGs (Padmanabhan et 
al. 2006; Blake et al. 2006). In combination with the 
WMAP 1-year data (Spergel et al. 2003), the Eisenstein 
et al. (2005) analysis of the LRGs provided the con- 
straint = 0.273 ± 0.025 from a combination of the 
overall shape of the correlation function and the peak 
caused by baryon acoustic oscillations. In contrast, a 
lower matter density VIm = 0.231 ± 0.021 is favoured 
by measurements of clustering of blue selected galaxies 
in the 2dFGRS (Cole et al. 2005), who included a sim- 
ple model for scale-dependent bias, although this had a 
relatively minor effect on the recovered matter density. 
Recent results from the 3-year WMAP data have pro- 
vided an independent constraint on the matter density, 
finding Om = 0.234± 0.035 (mean constraint from Table 
2 of Spergel et al. 2006) from a better resolution of the 
third peak height. It is clear that these discrepancies be- 
tween measurements are not linked to a single technique 
or particular analysis. 

When the constraints from the galaxy power spectra 
are combined with the CMB data the discrepancy is still 
clear. From Table 5 of Spergel et al. (2006), we see that 
combining the 3-year WMAP data with additional cos- 
mological constraints from the 2dFGRS power spectrum 
of Cole et al. (2005) gives f^M = 0.236;o;^29- How- 
ever, when the WMAP data is combined with constraints 
from the SDSS power spectrum of Tegmark et al. (2004), 
the higher Qm/i constraint from the large-scale structure 
data increases the best fit to Om = 0.266^0 q40 (^^^ 
ble 6 of Spergel et al. 2006). 

The tension between measurements from different 
large-scale structure experiments and the CMB observa- 
tions is at the level of approximately 2a, and it is there- 
fore possible that it could be explained by cosmic vari- 
ance. Alternative explanations include a scale-dependent 
galaxy bias on scales k < 0.15/iMpc~^, or a systematic 
problem with one of the data sets. In this paper we 
test these possibilities by measuring the redshift- space 
power spectrum from the latest SDSS sample. Data Re- 
lease 5 (DR5), which contains approximately twice as 
many galaxies as previously analysed, and 60% more 
LRGs than used in Eisenstein et al. (2005). By opti- 
mally analysing all of the galaxies to calculate the un- 
derlying power spectrum, we obtain the most accurate 
determination of the redshift-space power spectrum ever 



The SDSS DR5 Galaxy Power Spectrum 



3 



obtained for any sample of galaxies. First, we will tost 
if the discrepancy between previous 2dFGRS and SDSS 
power spectra remains. Second, the size of the sample 
means that we can consider how well the galaxies trace 
the mass, by testing the simple hypothesis that the shape 
of the power spectrum matches linear CDM models over 
k < 0.15/iMpc~^, and looking for discrepancies over 
these scales. Finally, the number of galaxies and volume 
covered are now sufficient to accurately measure power 
spectra for subsamples, splitting the catalogue to test for 
general changes in the shape of the power spectrum as a 
function of galaxy properties. 

In a parallel paper, Tegmark et al. (2006) present an 
analysis of a largely overlapping data set, drawn from 
SDSS DR4, with a focus on the implications for multi- 
parameter cosmological model fits. There are a number 
of differences in the analysis methods. Tegmark et al. 
(2006) use a pseudo-Karhoenen-Loeve method (Vogeley 
& Szalay 1996; Tegmark et al. 1997) to estimate the real 
space galaxy power spectrum, using finger-of-god com- 
pression and linear theory to remove redshift-space dis- 
tortion effects. We use the Fourier method of Percival et 
al. (2004), which extends that of Feldman et al. (1994), 
to estimate the angle-averaged (monopolc) redshift-space 
galaxy power spectrum. We combine the LRG and main 
galaxy samples, while Tegmark et al. (2006) concentrate 
on the LRGs, after showing that they have a power spec- 
trum shape consistent with that of the main sample. 
In addition, the many technical decisions that go into 
these analyses, regarding completeness corrections, angu- 
lar masks, K-corrections, and so forth, were made inde- 
pendently for the two papers, and they present different 
tests for systematic uncertainties. Despite these many 
differences of detail, our conclusions agree to the extent 
that they overlap (this will be discussed in Tegmark et 
al. 2006), a reassuring indication of the robustness of the 
results. 

2. THE SDSS DR5 SAMPLE 

The Sloan Digital Sky Survey (SDSS; York et al. 
2000; Stoughton et al. 2002; Abazajian et al. 2003, 2004; 
Adelman-McCarthy et al. 2006a) is an ongoing survey 
using a 2.5m telescope (Gunn et al. 2006) to obtain 10^ 
square degrees of imaging data in five passbands u, g, r, 
i and z (Fukugita et al. 1996; Gunn et al. 1998). The 
images are reduced (Lupton et al. 2001; Stoughton et 
al. 2002; Pier et al. 2003; Ivezic et al. 2004) and cali- 
brated (Lupton et al. 1999; Hogg et al. 2001; Smith et al. 
2002; Tucker et al. 2006), and galaxies are selected in two 
ways for follow-up spectroscopy. The main galaxy sam- 
ple (Strauss et al. 2002) targets galaxies brighter than 
r = 17.77 (approximately 90 per square degree, with 
a median redshift z = 0.11); in this paper we use the 
DR5 sample (Adelman-McCarthy et al. 2006b) contain- 
ing 465789 main galaxies that meet our selection criteria. 
In a small subset of the data taken during initial survey 
operation, we set a conservative faint magnitude limit 
corresponding to r = 17.5, to avoid minor fluctuations in 
the survey depth. 

In addition to the main galaxy sample, we also select 
56491 cut-I and cut-II Luminous Red Galaxies (LRGs; 
Eisenstein et al. 2001). The selection of these galax- 
ies, based on r and i colours and going to a deeper 
r magnitude, adds approximately 15 galaxies per square 



degree in addition to the main sample, and extends the 
redshift distribution to z ~ 0.5. In our sample, 65032 
of the main galaxies are also targeted in the SDSS as 
LRGs, but only 21310 of these galaxies are intrinsically 
luminous with Mo.i^ < —21.8 (Mo.i^ is defined in Sec- 
tion 2.3). Our sample therefore consists of 77801 bright 
LRGs and 444479 other galaxies, giving 522280 galaxies 
in total. 

These galaxies and objects selected for SDSS spectro- 
scopic observation for other scientific programmes are as- 
signed to plug-plates using a tiling algorithm designed to 
ensure nearly complete samples (Blanton et al. 2003a). 
The spectra are good enough to allow redshifts to be 
obtained for almost all of the galaxies selected for obser- 
vation. A detailed review of the SDSS galaxy observing 
strategy and the main galaxy sample is given by Tegmark 
et al. (2004). 

In our investigation, we simultaneously analyse the 
main galaxy sample and the LRGs, therefore including 
correlations between the two data sets in addition to 
internal correlations within the individual subsamples. 
This combination is aided by the fact that the transi- 
tion from main galaxies to LRGs within the survey is 
smooth in terms of galaxy properties and c;xpccted bias. 
In this section we present an overview of the data and 
the techniques used to model the selection function of 
this combined sample. 

Our chosen analysis method uses an empirically de- 
termined model of luminosity-dependent (but scale- 
independent) bias to increase the accuracy with which 
the underlying power spectrum can be recovered, and 
correct for offsets in the measured power caused by such 
a bias (a description of the model is provided in Sec- 
tion 3). We apply two cuts to the final DR5 sub-sample 
that we analyse as a consequence of this model: we ex- 
clude low luminosity LRGs with Mo.i^ > —21.8, and high 
luminosity galaxies Mo.i,. < —23.0 (whether main galaxy 
or LRG) . The reasons for these cuts are presented in Sec- 
tion 3. 

Due to practical limitations of fibre positioning, spec- 
tra cannot be obtained for objects closer than 55 arcsec, 
within a single spectroscopic tile. This is mitigated to 
some extent by multiple observations of the same region 
where tiles overlap, but we choose to apply a further cor- 
rection; if a targeted main galaxy or LRG has no redshift 
and lies within 55 arcsec of another main galaxy or LRG 
with a redshift, then the observed redshift is assigned to 
both galaxies. Main galaxies (non-LRGs) and LRGs are 
treated separately; where a main galaxy or LRG is not 
assigned a fibre due to collision with a quasar or galaxy 
that is not in the same class (e.g. a main galaxy is ob- 
scured by a LRG), then no redshift is assigned. This 
is the procedure adopted by Zehavi et al. (2002) where, 
by comparison with just the plate overlap regions where 
fibre collision has a reduced impact, it was shown to pro- 
vide a sufficient correction for this eff'ect on the large- 
scales of interest in the current study. We apply a close- 
pair correction to 3132 LRGs and 19402 main galaxies, 
which form 4.3% of the total population. 

In certain regions observed early in the survey oper- 
ations, we exclude all main galaxies with r > 17.5, so 
the main galaxies form two populations with r < 17.5, 
or r < 17.77 depending on angular position. 49688 of 
the main sample galaxies (11.9%) lie in the regions lim- 
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Fig. 1. — Part of the SDSS DR5 sample region plotted as a 
function of celestial coordinates. Note that this is not an equal 
area projection, so the plate outlines (solid lines) do not form per- 
fect circles. Different colours delineate groups of pixels where the 
completeness of the survey is expected to be the same (see text for 
details). Black points show the positions of the galaxies. White 
regions are bad areas excluded from the survey mask. Where these 
areas are due to bad photometric fields, the regions often follow 
the drift scanning strategy of the photometric survey - hence the 
white stripes at o ~ 132°, <5 ~ 48°. 

ited to r < 17.5. We assume that the LRGs form a 
single population with an isotropic redshift distribution. 
The radial selection function of our combined sample can 
therefore be decomposed into three sub-samples with dif- 
ferent radial distributions - main galaxies with r < 17.5, 
main galaxies with r < 17 .77 , and LRGs. The limits 
of the survey are trimmed by setting redshift limits on 
the combined sample by only considering galaxies with 
0.003 < z < 0.5. LRGs not in the main galaxy sam- 
ple must have z > 0.15, and main sample galaxies must 
have z < 0.3. This removes regions where the selection 
function becomes small and poorly determined. 

2.1. Angular Selection 

The angular selection function of the SDSS galaxies 
was modelled using a routine based on a HEALPIX 
(Gorski et al. 2005) equal-area pixellization of the sphere. 
HEALPIX was used to decompose the sphere into 
3145728 pixels, each of size 0.013 deg^. This means that 
each SDSS plate is covered by 532 pixels, and the DR5 
sample analysed covers 487177 pixels. Given the large 
angular scales of interest, the effect of this pixellization 
should be negligible on the resulting power spectrum. 

The first task is to find groups of pixels that have the 
same spectroscopic targeting information - they cover 
regions selected in the same targeting run(s), covered by 
the same set of tiles, and are within the photometric 
SDSS survey region - i.e. not in a bad field. The SDSS 
photometric survey area is decomposed into fields of ap- 
proximate size 0.033 square degrees. Consequently, bad 
fields, as defined in the SDSS Catalogue Archive, often 
only cover part of a pixel, so for each pixel we have al- 
lowed the effective area to be reduced by bad fields rather 
than removing the whole pixel. There are small internal 
regions within the area covered by the SDSS imaging 



that are not covered by spectroscopic tiles, which only 
have a few possible target galaxies, if any at all. We 
have included such regions as separate groups of pixels 
in our analysis. In total, we decompose the survey region 
in 6447 distinct groups. Note that we use a full list of 
targeted plates to create this decomposition of the spher- 
ical surface rather than just the observed plates; it is the 
targeting algorithm (Blanton et al. 2003a) that decides 
the distribution of the spectroscopically observed galax- 
ies, so the distribution across observed plates will depend 
on the positions of the unobserved overlapping plates. 

An example region within the angular mask created 
for the survey is plotted in Fig. 1, showing the differ- 
ent groups. The staves (named for the similarly shaped 
planks in barrel making) within the survey, correspond- 
ing to photometric sections from different great circle 
scans, are clearly visible. The most obvious pattern in 
the decomposition of the angular mask is due to the 
overlap of different spectroscopic plates, which can leave 
some small regions containing only a handful of galax- 
ies. The boundaries to different targeting regions follow 
segments of staves, leading to additional divisions across 
different staves. This region was chosen as it includes 
a small area covered by bad fields that shows the inter- 
leaved drift-scan strategy of the SDSS photometric ob- 
servations, splitting the stave into 12 individual columns. 

Having created a list of target regions on our pixel- 
lated mask, we calculated the completeness within each 
group, the ratio of good quality spectra to targets. Any 
small internal group containing no targets was assigned 
a completeness of 1 . We have applied a completeness cut 
of 70% and exclude regions with a lower completeness. 
In general this only removes regions where the spectro- 
scopic survey is incomplete and further observations are 
scheduled for a particular group. For our final sample, 
97.6% of the galaxies targeted have rcdshifts with spec- 
troscopic confidence greater than 80%, as defined in the 
SDSS Catalogue Archive Server (CAS), after the fiber 
collision correction mentioned above. 

2.2. Radial Selection 

In order to fully quantify the expected distribution of 
galaxies, we obviously need to model the radial galaxy 
distribution. We do this in different ways for the main 
galaxies and LRGs. For the main galaxies, we have found 
that a simple fit of the form (Baugh & Efstathiou 1993) 



z^ exp 



(1) 



provides an adequate fit to the data, where g, b and Zc 
are fitted parameters. Fig. 2 shows the distribution of 
main galaxy redshifts in the SDSS DR5 sample compared 
with a fit of this form for apparent magnitude limits of 
r = 17.5 and r = 17.77; as discussed in Section 2.1, in 
some angular directions the effective magnitude limit had 
to be reduced to r = 17.5. For galaxies with r < 17.5, 
the best-fit model redshift distribution has Zc = 0.0955, 
b = 1.88, g = 1.35. For galaxies with r < 17.77, the 
best-fit value of z^ changes to Zc = 0.106. 

For the LRGs, we fit the radial number density using a 
cubic spline (Press et al. 1992) with nodes separated by 
Az — 0.05, although we add additional nodes to enable 
the fit to match the distortions at z ~ 0.3, 0.34, 0.44, 
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Fig. 2. — Radial distributions of the SDSS DR5 main galax- 
ies (solid circles) to an apparent r magnitude limit of 17.5 (lower 
panel), and 17.77 (middle panel). The sharp increase in the num- 
ber of galaxies at z = 0.08 is predominantly the effect of the "Sloan 
Great Wall" (Gott et al. 2005). These data arc fitted using equa- 
tion (1), shown by the solid lines (Baugh & Efstathiou 1993). In 
the upper panel we show the distribution of the number density 
of the LRGs (excluding those in the main galaxy sample). These 
data are fitted with a smooth cubic spline (solid lines) with nodes 
selected to allow the curve to fit the sharp distortions caused by 
spectroscopic features moving through the SDSS filters used to se- 
lect the LRGs, and the mixture of cut-I and cut-II LRGs (see text 
for details). 

caused by spectroscopic features moving through the 
SDSS filters used to select the LRGs, and the join be- 
tween cut-I and cut-II LRGs. At the higher redshifts 
probed by the LRGs, the effect of clustering on the red- 
shift distribution diminishes, and a spline fit is less likely 
to remove structure compared with lower redshift data. 

2.3. Luminosities 

Where specified, we have K-corrected the galaxy lu- 
minosities using the methodology outlined in Blanton 
et al. (2003a,b). In particular, we have used the 
kcorrect_v4_l_4 software package using the observed 
M,g,r,i,z Galactic extinction-corrected (using the maps of 
Schlegel et al. 1998) Petrosian magnitudes from the DR5 
CAS archive and their measured errors. We have used 
the "BEST" database to obtain the galaxy magnitudes, 
and the "TARGET" database to obtain the list of galaxy 
targets. We also use the same z = 0.1 shifted r-band 
filter to define our luminosities (as discussed in Blan- 
ton et al. 2003b), which wc refer to as Mo i ,, throughout 
this paper. Galaxy luminosities without K-corrections 
are written as M^.. We use the same method for both 
the LRG sample and the main SDSS galaxy sample, and 
have not added evolutionary corrections. We have as- 
sumed Qm = 0.3 and fl\ = 0.7 for computing these 



Fig. 3. — The colour of 5% of the main galaxies in the SDSS 
DR5 sample, selected at random. The well-known bimodal split 
between red and blue galaxies is clear, approximately split by 
Mo.ig — Mo.i^ = 0.8 (dashed line). This plot highUghts the fact 
that as we change the magnitude of the samples, we also change 
the average colour, with the more luminous main galaxies being 
predominantly redder. 

K-corrections and applied recommended AB corrections 
to the observed SDSS magnitude system (Smith et al. 
2002). 

2.4. Colours 

The average colours of galaxies change as the luminos- 
ity increases. Fig. 3 shows the Mo.i^ — Mo.i^ colour dis- 
tribution of these galaxies, plotted as a function of Mo.i^. 
The well-known bimodal red-blue split in galaxy colours 
(Strateva et al. 2001; Baldry et al. 2004) is evident, and 



can be approximately delineated by Mo.i, 



■Mo 



0.. 



Importantly, for our development of a bias model for 
these galaxies, as we change the magnitude wc also 
change the colour of our sample, with more luminous 
galaxies being predominantly redder. We wiU see in the 
next section that this means that there is a smooth tran- 
sition between the main galaxy and LRG samples. 

3. MODELLING SCALE-INDEPENDENT GALAXY BIAS 

For a magnitude limited survey, on average, the largest 
scales will be probed by the most luminous galaxies. Lu- 
minous galaxies are known to be more biased than less- 
luminous galaxies (e.g. Park et al. 1994), so this relative 
bias needs to be quantified in order to minimise any sys- 
tematic effects caused by comparing different types of 
galaxies on different scales. For the main galaxy sample 
we analyse subsamples with different average luminos- 
ity, in order to measure the relative bias (Section 3.1). 
The LRG sample is now sufficiently large that we can 
also consider splitting this catalogue into subsamples as 
a function of luminosity (this is discussed in Section 3.2). 

3.1. Main Galaxies 

In order to empirically determine the relative cluster- 
ing strengths of the main galaxy sample as a function of 
luminosity, we split the catalogue based on the absolute 
magnitude without K-correction or correction for evolu- 
tion in the luminosity function, and the corresponding 
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Fig. 4. — The redshift distributions of two of our pseudo- volume 
limited catalogues (solid lines). The selection of these catalogues 
was based on absolute magnitudes without K-corrections or correc- 
tions for evolution, and so arc not strictly volume limited. These 
effects were instead included in estimating the redshift distribution 
of the samples to create matched random catalogues (dotted lines; 
see text for details). For comparison, the dashed curves show the 
redshift distribution that the catalogues would have if they were 
strictly volume limited - as can be seen these are a poor fit to the 
data. 

redshift limits where the sub-catalogue is complete (see 
Fig. 5). We analyse 8 catalogues of width AM^ = 0.5 
with —22.0 < Mr < —18.0, and an additional cata- 
logue of bright galaxies with —23 < < —22. We 
call these catalogues pseudo- volume limited, as they arc 
not strictly volume limited because we have ignored K- 
corrections and evolution in the luminosity function. In 
order to estimate the redshift distribution of each sam- 
ple, we determine the average K-correction as a function 
of redshift, and use this, together with the best-fit evolu- 
tionary corrections of Blanton et al. (2003b), to predict 
the expected galaxy number density. The redshift distri- 
butions for two of our pseudo- volume limited catalogues 
are compared with the modelled distribution in Fig. 4, 
where good agreement is demonstrated. Obviously, we 
can still calculate the average absolute magnitude includ- 
ing K-corrections for each sample, and it is this magni- 
tude that we use to parametrise our bias model. 

The distribution of absolute magnitude (without K- 
corrcction) against redshift for the SDSS DR5 sample 
of galaxies is presented in Fig. 5. We plot against 
redshift for 5% of the galaxies in the combined sample, 
randomly selected. The main galaxy volume limited cat- 
alogues are delineated by the boxes in this plot, and the 
upper and lower apparent magnitude limits of the main 
galaxy survey are also shown. 

For each pseudo- volume limited catalogue we have cal- 
culated the power spectrum using the same method ap- 
plied to our final catalogue in Section 4, with the excep- 
tion that a uniform bias model was applied - we treat all 
galaxies as having h — 1, so the relative amplitudes of the 
recovered power are not affected by an input bias model. 
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Fig. 5. — The distribution of the SDSS galaxies in the redshift — 
luminosity plane. Absolute magnitudes were calculated assuming a 
flat A cosmology with Clj^ = 0.3 and have not been K-corrccted or 
corrected for evolution. The upper and lower apparent magnitude 
limits of the main galaxy sample are shown by the dashed lines. 
The redshift and magnitude limits of the pseudo-volume limited 
catalogues analysed to calculate the relative bias as a function of 
absolute magnitude are shown by the overlaid rectangles. 

Some of these power spectra arc plotted in Fig. 18. The 
window function for each sub-catalogue was calculated 
as for our final catalogue, and we have fitted the power 
spectrum amplitude over 0.01 < k < 0.2/iMpc~^ using 
window- convolved models with approximately the cor- 
rect large-scale shape. The power spectra recovered from 
the different subsamples are correlated, and we have not 
estimated their relative errors, as would be strictly re- 
quired when comparing their relative amplitudes. In- 
stead, we simply measure the average and standard de- 
viation of the difference between measured and model 
power spectra over the range of scales of interest, and 
calculate the bias from this. Consequently, the biases are 
not optimally determined, and their errors do not include 
the effect of cosmic variance for the regions of the cata- 
logues that do not overlap, and will probably therefore 
under-estimate the true error. However, as wc show in 
test (5) of Section 7, we do not need to know the relative 
biases to high precision as the resulting power spectrum 
is not sensitive to the exact form of this correction. 

The relative biases measured from the pseudo-volume 
limited catalogues are plotted (solid circles) in Fig. 6 
as a function of the average K-corrected absolute mag- 
nitude. The smooth shape matches the overall shape 
of the simple formula of Norberg et al. (2001) 6/6* = 
0.85 + 0.15L/L., (solid line), with = -20.44 (Blanton 
et al. 2003b). However, the addition of an extra term 
6/6* = 0.85 + 0.15L/L* + 0.04(M* - Mo.i^) (Tegmark et 
al. 2004; Zehavi et al. 2005b) allows for a sharper increase 
in bias with luminosity, which is a better fit to the data. 

When measuring the power spectrum from the final 
combined galaxy catalogue (see Section 4 for a descrip- 
tion of the method), we only need to know the averaged 
properties of the expected bias at each spatial location. 
Consequently, even though wc expect galaxy bias to de- 
pend on colour as well as luminosity, a simple luminosity- 
bias relation can still be used provided that the cata- 
logues from which the relation is derived contain the 
same distribution of galaxy colours in each luminosity 
bin as galaxies of that luminosity in the combined galaxy 
sample. For our pseudo- volume limited subcatalogues of 
the main sample galaxies, we only exclude galaxies that 
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Fig. 6. — The bias of the SDSS main galaxies (sohd eircles with 1- 
a errors) and LRGs (open triangles with 1-cr errors) as a function 
of Mo.i,,. Horizontal errors show the range of luminosities anal- 
ysed in each sample (ignoring K-corrcctions), while the data point 
is placed at the weighted average K-corrccted luminosity. The re- 
lation suggested by Norberg et al. (2001), 6/6* = 0.85 -|- 0.15L/Lt, 
where M* = —20.44 (Blanton et al. 2003b) is shown by the solid 
line. The dashed line shows the alternative formula of Tegmark et 
al. (2004), predicting a steeper increase in bias for luminous galax- 
ies. As we are only interested in the relative normalisation of the 
bias, we allow 6* to vary to fit the data for the different formulae, 
meaning that the curves do not cross at L = L*. 

lie beyond the redshift limits applied for each catalogue 
(See Fig. 5). Because the catalogues are relatively nar- 
row in magnitude, AM^ = 0.5, they contain between 
67% and 77% of the total number of galaxies within the 
chosen magnitude limits (this number fluctuates because 
of the changing steepness of the mimbcr counts shown in 
Fig. 5). Consequently, each sub-catalogue should have 
approximately the same galaxy colour distribution as 
all galaxies of that luminosity in the main sample. At 
high luminosities, the galaxies are predominantly LRGs 
as shown in Fig. 3, while at low luminosities they are 
formed of a mix of red and blue galaxies. 

3.2. LRGs 

The DR5 LRG sample is of sufficient size that the clus- 
tering can now be measured as a function of luminosity 
as for the main galaxy sample. The distribution of LRG 
magnitudes is plotted against redshift in Fig. 5, show- 
ing that the LRGs form a natural extension of the main 
galaxy sample to higher redshifts. However, given the 
more complicated selection function of the LRGs, mod- 
elling the redshift distribution is not as straightforward 
as for the main galaxy sample. Consequently, for sub- 
catalogues of LRGs selected as a function of luminosity, 
we have fit the number density as a function of redshift 
with the cubic spline fit as described in Section 2.2 for 
the total LRG sample. Example fits are plotted for two 
of our LRG sub-catalogues in Fig. 7. Because of the in- 
creased volume covered by the LRGs, although there are 




redshift 

Fic;. 7. — The number density as a function of redshift of two 
of our magnitude selected LRG catalogues (solid histograms). The 
distribution is fitted by a cubic spline (smooth curve) with the 
positions of the nodes (crosses) determined as described in the 
text. 

relatively few galaxies, the effect of cosmological struc- 
ture is small, and we do not expect to remove power by 
fitting to the redshift distribution in this way. 

The relative bias measurements for the LRGs were cal- 
culated from these subsamples as for our main galaxy 
pseudo-volume limited catalogues, and are plotted in 
Fig. 6 compared with the main galaxy sample measure- 
ments. As can be seen, the bias increases with luminosity 
for the LRGs as for the main galaxies, and the high lumi- 
nosity data match the simple formula of Tegmark et al. 
(2004). We do sec; a difference between LRG and main 
galaxy bias at low luminosities, because the colour cuts 
applied to select these LRGs will remove blue galaxies 
that are in the main galaxy sample (see the colour distri- 
bution plotted in Fig. 3), matching the findings of Zehavi 
et al. (2005a). This colour dependence is not included in 
our bias model, so we simply exclude the lowest LRG lu- 
minosity bin from our final combined sample, removing 
LRGs with Mo.i^ > —21.8. We also exclude all high lumi- 
nosity galaxies Mo.i^ < —23.0 (whether main galaxy or 
LRG), where an expected high bias would have a strong 
effect on the power spectrum. 

We calculate the relative bias of galaxies using the K- 
corrected absolute magnitudes, assuming a flat A cos- 
mology with JIm = 0.3. The large-scale relative am- 
plitudes of the power spectra of our galaxy subsamples 
are only weakly dependent on the model chosen to con- 
vert from redshift to comoving distance. Consequently, 
even though we may assume a different cosmology when 
analysing the full galaxy sample, we choose to still use 
the bias as a function of K-corrected absolute magnitudes 
calculated assuming a flat A cosmology with = 0.3. 
We do not change the bias model as a function of cos- 
mology. 

4. MEASURING THE POWER SPECTRUM 
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1000 



pair separation / h Mpc 

Fig. 8. - Weighted pair counts from LRG-LRG (solid line), 
niairi-inain (dotted line), and LRG-rnain (dashed line) pairs. This 
plot shows how well the correlation function can be measured as 
a function of scale from these different samples, and is directly re- 
lated to the power spectrum measurement at a particular scale (e.g. 
Tegmark ct al. 2006). The LRG-LRG pairs dominate the measure- 
ment on all scales, although there is a significant contribution from 
pairs formed of a LRG and a main sample galaxy. 




0.01 

wave number k / h Mpc~^ 

Fig. 9. — Cubic spline fits to the spherically averaged window 
functions in Fourier space (solid lines) with nodes given by open 
circles for the SDSS DR5 LRG subsample, solid circles for the 
main galaxy subsample and solid triangles for the combined LRG 
and main galaxy sample. For clarity, we only plot the measured 
window function power for the combined sample (crosses). In order 
to highlight small scale structure caused by the inhomogeneous 
nature of the survey, we multiply |G(fc)p by k^. 



The SDSS galaxy power spectrum has been calculated 
using the Fourier based method of Percival et al. (2004). 
Given a model for the average relative bias at each lo- 
cation in the survey, this method extends the method of 
Feldman et al. (1994) to remove the effect of differential 
bias from the recovered power spectrum. Additionally, 
this average bias is used to optimally weight the galaxies 
- more luminous galaxies are stronger tracers of the un- 



derlying density field, contain more information about 
the fluctuations, and therefore should be up-weighted 
compared with less luminous galaxies. 

The Percival et al. (2004) method was used by Cole 
et al. (2005) to analyse the distribution of galaxies in 
the 2dFGRS, but there are some differences between our 
analysis and that of Cole et al. (2005) , so we now provide 
a brief description of the method applied to the SDSS 
highlighting these differences. The primary change is 
that, because the sample of galaxies extends to redshift 
0.5, we have dropped the common assumption of using 
a single cosmological model in order to convert redshifts 
to comoving radial distances. Instead, we recalculate the 
power spectrum for different flat A cosmological models. 
We will sometimes wish to distinguish these cosmologies 
from those used to create model power spectra. When we 
do this, we will refer to the model used to calculate co- 
moving distances as the "cosmological distance model" . 
Consequently, the data do not compress into a single 
power spectrum, and there is no single power spectrum 
resulting from our analysis. 

At its heart, the Fourier method provides the simplest 
way to calculate something approximating the galaxy 
power spectrum. The galaxies are decomposed onto a 
grid, the grid is Fourier transformed, and the amplitude 
of the Fourier modes measured. One key complication 
is that weights are applied to each galaxy to optimally 
reduce the error in the recovered power. Assuming a 
model for the galaxy bias, the weights applied to a galaxy 
at location r with expected bias 6' are those derived in 
Percival et al. (2004), 



w{r, b') 



{b'nr)P{k) 



1 + Jdb{n{r, b))b'^P{k)' 



(2) 



where P{k) is an estimate of the (unbiased) power spec- 
trum, and (n(r, b)) is the expected density of galaxies as 
a function of space and_bias. In the analysis presented in 
this paper we assume P{k) = constant = 5000 ft'^Mpc^'"', 
for simplicity. This does not have a strong effect on the 
accuracy of the recovered power spectrum. The relative 
weighted contributions from pairs of LRGs, main galaxies 
and LRG-main galaxy pairs are plotted as a function of 
scale in Fig. 8. As can be seen, although the LRGs dom- 
inate the analysis on all scales there is a significant con- 
tribution to the weighted pair counts from main galaxy- 
LRG pairs. Had we allowed P{k) to be reduced on small 
scales (as would be optimal), then the higher density re- 
gions would contribute more to the weighted pair counts 
and the main sample galaxies would have dominated the 
distribution on these smaller scales. 

Given a weight Wi and expected bias bi for each galaxy, 
the overdensity field can be written 
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where TV is a normalisation constant 

N=ljd^r J db {w{r, b)n{r, b)) 



1/2 



(3) 



(4) 



and {w{r, b)n{r, b)) is the expected weighted distribution 
of galaxies as a function of bias b and location r. We 
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Fig. 10. — Top panel: correlations between data values 
calculated from 2000 log-normal catalogues, assuming a flat 
A cosmology with Qm = 0.24 for the cosmological distance 
model. Denoting the covariance between two power spectrum 
values P(fei) and P{kj) as coy{ki,kj) = {P{ki)P{kj)), then 
we plot the correlation between the two measurements given by 
cov{ki,kj)/[cov{ki,ki)cov{kj,kj)]^/^. For presentation, we have 
calculated the correlations plotted after matching the power spec- 
tra amplitudes recovered from the log-normal catalogues. This 
removes any normalisation error and only shows correlations in- 
duced by the window function. The covariance matrices used to 
calculate likelihoods were calculated from the raw power spectra 
calculated from the log-normal catalogues and therefore include the 
error in the overall normalisation. For 0.01 < A; < 0.15/iMpc~^, 
we see that the correlation between data points is < 0.33 for 
\ki — kj\ > 0.01 Mpc^-' . Bottom panel: The normalised win- 
dow function for each of our binned power spectrum values with 
0.01 < k < 0.15 Mpc^-' . Each curve shows the relative contribu- 
tion from the underlying power spectrum as a function of k to the 
measured data values plotted in Fig. 12. The normalisation is such 
that the area under each curve is unity. 

choose to model this field using a random catalogue with 
points selected at the mean galaxy density (n(r)), using 
the fits outlined in Section 2.2. We use a random cat- 
alogue containing 10 times as many points as we have 
galaxies. Because wc only need to determine the inte- 
gral of the average of the weighted and bias corrected 
density, we do not need to assign a luminosity to each 
point in the random catalogue and calculate a bias from 
this. Instead, we simply calculate weights and biases for 
the random catalogue by fitting the average radial values 
of Wi and Wi/hi in the galaxy catalogue as a function of 
redshift using cubic splines. The weights in the random 
catalogue are renormalised, compared with the weights 
applied to the galaxies so that / F{y) dr = 0, thereby 
matching the total weighted number density in galaxy 
and random catalogues. 

The statistic that we use to compare models to the 
data is the shot noise subtracted power spectrum of the 
overdensity field F{r), measured in a series of fc-space 
bins. The expectation value of this statistic is 



|i^(k)|2 - P,hot) 



-! 



:P(k')|G(k-k')P, (5) 




Fig. 11. — The amplitudes of three of the diagonal covariance 
matrix elements, estimated for fiat A cosmological models with 
different Qm, from 2000 log- normal catalogues. These are plotted 
as a function of Sljvf with AQm = 0.01. The noise in the individual 
calculations is clear; this scatter has been minimised by fitting the 
data with a smooth cubic spline fit shown by the solid lines. We do 
not plot the error on the covariance matrix (the error in the error) 
for clarity. 

shot noise is 
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where we average |i^(k)|^ over all fc-space directions. The 



and the window function |G(k)|^ is the normalised power 
in a Fourier transform of 

G(r) = y db{n{r,b)w{r,b)). (7) 

The spherically averaged window functions from the 
LRGs, the main galaxies and the combination of main 
galaxies and LRGs from the SDSS DR5 sample are com- 
pared in Fig. 9. The large volume probed by the LRGs 
means that the /c-space window is small compared with 
that calculated for the main galaxies. The window func- 
tion from the combined LRG and main galaxy sample 
is close to that of the LRGs, although the main galax- 
ies do provide more pairs of galaxies at intermediate 
scales, smoothing the structure within the window func- 
tion. Spline fits to the window functions are used to nu- 
merically determine the effect of the window on a model 
power spectrum. For a smooth power spectrum, the fea- 
tures in the window function are relatively unimportant 
compared with the overall shape. The correlations in- 
duced by the window for the combined main galaxy and 
LRG sample on the binned power spectrum are plotted 
in the bottom panel of Fig. 10. 

The recovered power spectrum values are assumed to 
be distributed as a multi-variate Gaussian, and we es- 
timate the covariance matrix of this Gaussian distribu- 
tion using log-normal catalogues (Coles & Jones 1991). 
For each of 31 flat cosmological distance models with 
0.1 < riM < 0.4 and AflM = 0.01, we have created 
2000 log-normal catalogues (using the method described 
in Cole ct al. 2005). The distribution of galaxies in these 
catalogues was calculated using the appropriate cosmo- 
logical distance model, while the power spectrum was cal- 
culated using a linear CDM model (see Section 5) with 
parameters chosen to approximately match the ampli- 
tude and shape of the recovered power for 0.01 < k < 
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Fig. 12. — The rcdshift-space power spectrum recovered from the combined SDSS main galaxy and LRG sample, optimally weighted for 
both density changes and luminosity dependent bias (solid circles with l-cr errors). A flat A cosmological distance model was assumed with 
Qm = 0.24. Error bars are derived from the diagonal elements of the covariance matrix calculated from 2000 log-normal catalogues created 
for this cosmological distance model, but with a power spectrum amplitude and shape matched to that observed (see text for details). 
The data are correlated, and the width of the correlations is presented in Fig. 10 (the correlation between data points drops to < 0.33 for 
Afc > 0.01 /iMpc"^). The correlations are smaller than the oscillatory features observed in the recovered power spectrum. For comparison 
we plot the model power spectrum (solid line) calculated using the fitting formulae of Eisenstein & Hu (1998); Eisenstein et al. (2006), for 
the best fit parameters calculated by fitting the WMAP 3-year temperature and polarisation data, h = 0.73, Qm = 0.24, ris = 0.96 and 
Qh/^M = 0.174 (Spergel et al. 2006). The model power spectrum has been convolved with the appropriate window function to match the 
measured data, and the normalisation has been matched to that of the large-scale (0.01 < k < 0.06 /t Mpc^-') data. The deviation from 
this low Qm linear power spectrum is clearly visible at > 0.06 /i Mpc^-' , and will be discussed further in Section 6. The solid circles with 
1(T errors in the inset show the power spectrum ratioed to a smooth model (calculated using a cubic spline fit as described in Percival et al. 
2006) compared to the baryon oscillations in the (WMAP 3-year parameter) model (solid line), and shows good agreement. The calculation 
of the matter density from these oscillations will be considered in a separate paper (Percival et al. 2006). The dashed line shows the same 
model without the correction for the damping effect of small-scale structure growth of Eisenstein et al. (2006). It is worth noting that this 
model is not a fit to the data, but a prediction from the CMB experiment. 
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0.15/1 Mpc . As we will sec in Section 6, this means 
that the model power is lower than the power calculated 
from the data on scales k < 0.02/iMpc~^, so the errors 
calculated from the log-normal catalogues are probably 
slightly overestimated on these scales. The diagonal ele- 
ments of the covariance matrix calciilated in this way are 
based on the effects of cosmic variance and shot noise. 
The off-diagonal covariance matrix elements include both 
the effect of the P{k) window functions (bottom panel 
of Fig. 10) and the mode coupling induced by non-linear 
evolution, to the extent that the latter is adequately de- 
scribed by the log-normal approximation. The correla- 
tions induced by these effects are shown in the top panel 
of Fig. 10. 

The noise in a single covariance matrix element would 
not normally be noticed when calculating parameter con- 
straints, as it would affect all models in the same way. 
However, when we use different covariance matrices to 
test different cosmological models these errors can be- 
come important. Interestingly, if the power spectrum val- 
ues have a Gaussian distribution, then our estimates of 
the elements of the covariance matrix will be drawn from 
a Wishart distribution, the same distribution formed by 
temperature and polarisation CMB power spectra in an 
all-sky survey (Percival & Brown 2006). We minimise the 
effect of this noise by smoothing each element in the set of 
covariance matrices using a separate 4-nodc cubic spline 
with nodes at ^Im = 0.1,0.2,0.3,0.4. Three examples of 
this fit are plotted in Fig. 11, for the diagonal elements 
of the covariance matrix at fc = 0.05, 0.1, 0.15 /i Mpc"'^. 
There is a clear general trend that the expected error 
in the power increases with the matter density. This is 
caused by the changing comoving distance-redshift rela- 
tion, which means that a smaller volume is predicted for 
the survey assuming Qm = 0.4, rather than Qm = 0.1. 

The Fourier transforms used in this paper were per- 
formed on 512'^ grids with a varying box size, where we 
only consider modes that lie between 1/4 and 1/2 of the 
Nyquist frequency for each box. The smoothing effect of 
the galaxy assignment is corrected as described in Cole 
et al. (2005). We have compared with both larger 1024^ 
and smaller 256'^ Fourier transforms, and find no evi- 
dence for systematics induced by the practicalities of the 
Fourier transforms. 

Assuming a flat cosmological distance model with 
fJjvf = 0.24, matched to the parameters recovered from 
the 3-year WMAP CMB data, the recovered SDSS power 
spectrum is plotted in Fig. 12. Because of the bias- model 
correction, the normalisation of this power spectrum is 
matched to that of galaxies, where M* = —20.44 
(Blanton et al. 2003b). The precision with which this 
power spectrum is measured, particularly on large scales, 
is impressive. In Fig. 12, we also plot the linear matter 
power spectrum predicted from the best-fit WMAP pa- 
rameters, normalised to match the data on scales 0.01 < 
k < 0.06/iMpc~^. This shows good agreement in the 
shape of the power spectrum on these scales, but deviates 
on scales k > 0.06/iMpc~^. Significant non-linear cor- 
rections to the matter power spectrum are only expected 
for k > 0.15 /iMpc"^ (Smith et al. 2003), and large- 
scale redshift space distortions are commonly treated as 
being scale-independent on these large scales. Conse- 
quently, it has been common practice to assume that the 
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Fic;. 13. — The recovered SDSS DR5 power spectrum plotted in 
Fig. 12 compared with the previous SDSS real-space Main galaxy 
power spectrum calculated for the DR3 sample (from Table 3 of- 
Tegmark et al. 2004), and the 2dFGRS redshift-space galaxy power 
spectrum of Cole et al. (2005). The data were corrected for the 
effects of the different window functions by calculating the mul- 
tiplicative effect on a theoretical model with approximately the 
correct power spectrum shape. The normalisation of the previous 
data have been matched to that of the new power spectrum on 
large scales k < 0.06/iMpc~^. The shape of the power spectra 
agree well on these scales, but predict different power spectrum 
amplitudes on smaller scales. The solid and dashed lines show two 
linear CDM model power spectra, plotted as in Fig. 16. 

shape of the power spectrum recovered from galaxy sur- 
veys matches the linear matter power spectrum shape on 
scales k < 0.15 /iMpc"^ (Percival et al. 2001; Tegmark 
et al. 2004). However, the SDSS power spectrum has 
greater power on small scales than the power spectrum 
predicted by the 3-year WMAP data. 

In Fig. 13, we compare the recovered power spectrum 
with previous power spectra calculated from the SDSS 
DR3 main galaxy sample (Tegmark et al. 2004) and 2dF- 
GRS (Cole et al. 2005). The error bars on the DR5 data 
points are much smaller than those on the DR3 data 
points in part because of the greater sky coverage of DR5, 
and in (greater) part because of the inclusion of LRGs 
in the sample. It is also worth noting that the procedure 
used by Tegmark et al. (2004) constructs P{k) estimates 
that are nearly independent, while our estimates are cor- 
related as shown in Fig. 10. After corrections for the 
window functions and differences in the overall normal- 
isation, we see that the large-scale (k < 0.06/iMpc~^) 
shape of the power spectrum recovered from the SDSS is 
well matched to that recovered from the complete 2dF- 
GRS. On smaller scales {k > 0.06 /iMpc"^), our SDSS 
power spectrum of the combined main galaxy and LRG 
sample reveals more power than that recovered from the 
2dFGRS, while the previous power spectrum recovered 
from the SDSS DR3 main galaxy sample lies between 
the two. These differences will be discussed further in 
Section 9. The power spectrum analysis of the SDSS 
DR4 LRGs by Huetsi (2006) also showed a consistent 
power spectrum shape, although the current analysis su- 
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persedes this in a number of ways. We analyse a larger 
sample of galaxies and include an average bias model to 
correct for the fact that we're analysing a mixed sample 
of galaxies. We have also carefully constructed the an- 
gular mask of the survey, and have provided a number of 
tests of the analysis method. Consequently we do not di- 
rectly compare with the power spectrum derived in this 
work. In the next section we analyse the shape of the 
recovered SDSS DR5 power spectrum in detail, looking 
at the constraints it provides on the matter density. 

The 31 sets of power spectra, window functions, 
and covariance matrices for the combined main-LRG 
sample, each computed using a different ^Im for cos- 
mological distance calculations can be obtained from 
http://www.dsg.port.ac.uk/~percivalw/, or upon re- 
quest from WJP. To use these data to compare the rel- 
ative likelihood of two model power spectra, one should 
choose for each model the data set with the closest 
riM value, convolve the model P(fc) with the window 
functions provided, and calculate the likelihood for each 
model from the tabulated P{ki) as a multi-variate Gaus- 
sian using the corresponding covariance matrix. 

5. MODELLING THE GALAXY POWER SPEGTRUM 

We calculate linear CDM model power spectra using 
the fitting formulae of Eisenstein & Hu (1998), including 

a correction for the damping of the baryon oscillations 
due to low-redshift small-scale structure (Eisenstein et 
al. 2006). The damped power spectrum Pdamped(^) is 
assumed to be given by 

Pdampedik) = gP\in{k) ^ {I - ff)P(fc)smooth, (8) 

where Pii„ is the standard linear power spectrum, and 
-P(fc)smooth is the same power spectrum without the 
baryon acoustic oscillations. The parameter g = 
exp(— fc^CT^/2) is a Gaussian damping term, and we 
assume a = 10/i~^Mpc for the spherically averaged 
redshift-space power spectrum that we measure (Eisen- 
stein et al. 2006). 

There are three effects that distort the observed galaxy 
power spectrum from the linear matter power spectrum. 
These are non-linear structure growth, redshift-space dis- 
tortions and galaxy bias. Note that we take non-linear 
structure growth to correspond to the overall behaviour 
of the matter in the universe, rather than the effect 
of the collapse of small-scale structures on the galaxy 
power spectrum, which may be different, depending on 
how galaxies trace the mass. In this paper we will only 
be concerned with scales where the matter in the uni- 
verse is still expected to be predominantly in the lin- 
ear regime k < 0.15/;, Mpe^^ (Smith et al. 2003), so the 
effect of non-linear structure growth is small. For the 
best-fit 3-year WMAP parameters (Spergel et al. 2006), 
the Smith et al. (2003) fitting formulae predict an in- 
crease in power at = 0.15 /iMpc^^ due to non-linear 
effects of 8%. Increasing the value of erg from 0.77 to 
0.9 predicts a 10% increase due to non-linear effects at 
k = 0.15/iMpc~^. In contrast, if the linear power spec- 
trum predicted by the WMAP experiment is normalised 
to the data on scales 0.01 < k < 0.06/iMpc~^, then the 
measured SDSS power spectrum is 40% greater than this 
model at fc = 0.15 /iMpc"^ 

While redshift space distortions only depend on the 
distribution of mass - the galaxies effectively act as test 



particles within the gravitational potential, the strength 
of galaxy bias is predicted to depend on galaxy proper- 
ties. In particular, there is some theoretical work that 
suggests that we might expect a scale-dependent galaxy 
bias, even on scales k < 0.15 hMpc~^. Seljak (2001) sug- 
gest that within the halo model (Seljak 2000; Peacock & 
Smith 2000; Cooray & Sheth 2002), there are two ef- 
fects that may cause a scale-dependent bias for pairs of 
galaxies in different halos, which is particularly strong for 
LRGs (where the galaxies only occupy the most massive 
halos) : if there is an additional Poisson selection of ha- 
los to be populated, then there may be a Poisson term in 
the resulting power spectrum in addition to the standard 
shot noise term due to the finite number of galaxies (that 
is subtracted in the Feldman et al. 1994 method). Such 
a Poisson term would show up on large scales where the 
power spectrum has a lower amplitude. Additionally, for 
the most massive halos, the bias is a strong function of 
the halo mass. Small changes in the average mass of the 
halos occupied by the galaxies as a function of the scale 
probed could lead to a scale dependent galaxy bias. 

Using halo model calculations. Cole et al. (2005) in- 
troduced a simple two parameter model (hereafter the 
Q-model) to correct for the effects of galaxy bias and 
redshift space distortions, with 

1 + Ak ^ ' 

where A = lA and Q = 4.0 was suggested for the 
redshift-space 2dFGRS power spectrum by fitting halo 
model simulations. For the real-space power spectra, 
halo model simulations instead predicted that A = 1.7 
and Q = 9.6. On large scales, the different values 
of A mean that the redshift-space distortions increase 
the power spectrum slope (so the ratio of small-scale 
to large-scale power is higher). On small-scales the 
converse is true, and the redshift-space distortion de- 
crease in the amplitude of the small-scale power. Be- 
tween 0.01 < k < 0.15/iMpc~^, if the model power 
spectra are normalised on large scales, then this model 
predicts a 7% lower redshift-space power spectrum at 
k = 0.15 /i Mpc""'^ compared with the real-space power 
spectrum. This is very similar to the offset between lin- 
ear and non-linear power discussed above using the Smith 
et al. (2003) fitting formulae. We see that the real-space 
power spectrum for the 2dF galaxies predicted by the 
halo model tracks the non-linear increase in power, while 
the redshift-space distortions effectively cancel the non- 
linear increase in power leading to a redshift-space galaxy 
power spectrum shape that more closely matches that of 
the linear matter power spectrum. 

The two parameter Q-modcl is used by Tegmark et al. 
(2006) in their analysis of the SDSS LRG power spec- 
trum; here A = 1.7 was fixed (corrections for redshift- 
space effects are included in the method), but Q was 
allowed to vary to fit the data, with a best-fit value of 
Q ~ 26 determined from the power spectrum fits in this 
work. The different values of Q for the SDSS LRGs and 
2dFGRS galaxies reflect a change in shape in the power 
spectrum, with the LRGs being more biased on small 
scales relative to large scales. Obviously, in addition to 
this change in shape, the SDSS LRGs are more biased 
at any scale than the 2dFGRS galaxies. With the val- 
ues of A and Q appropriate for the LRGs, the Q-model 
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Fig. 14. — Likelihood contours calculated from fitting the power 
spectrum shape with CDM models. Ug = 0.96, and h = 0.73 
are assumed and we vary Qm and the baryon fraction. For each 
value of Qm tested, we recalculate the power spectrum, the win- 
dow function and the expected error on the power so the cosmolog- 
ical distance model matches the power spectrum model. Dashed 
contours were fitted for 0.01 < k < 0.06/iMpc~^, and solid con- 
tours for 0.01 < k < 0.15/iMpc^^. Contours are plotted for 
— 21n£ = 1.0, 2.3, 6.0, 9.2, corresponding to one-parameter con- 
fidence of 68%, and two-paxameter confidence of 68%, 95% and 
99% for a Gaussian distribution. 




Fig. 15. — As Fig. 14, but now only considering a single power 
spectrum, calculated assuming a flat Qm = 0.24 cosmology to 
determine comoving galaxy distances from the measured galaxy 
redshifts. 

correction to the Hncar power spectrum varies between 
0.98 and 1.01 over 0.01 < fc < 0.06/iMpc"\ and in- 
creases on smaller scales to 1.31 at fc = 0.15 /iMpc~^. 
It is worth noting that, while the Q-model was not de- 
signed for such highly biased populations, the modelling 
presented in Tegmark et al. (2006) suggests that it might 
still fit the clustering of these luminous galaxies. 

Rather than assume such a prescription when fitting 
models to the data in this paper, we take a step back and 
instead consider the observational evidence that such a 
correction is required. First, wc test the hypothesis that 
the observed large-scale power spectrum shape matches 
that of a CDM linear matter power spectrum over the 
scales where the matter is predicted to be approximately 
in the linear regime. We do this by assuming that the ob- 
served galaxy power spectrum does match a linear CDM 
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Fig. 16. — The power spectra recovered from the combined SDSS 
DR5 sample assuming Qm = 0.24 to convert from redshifts to 
comoving distances (solid circles with l-cr errors) compared with 
the best fit CDM models. The dashed line shows the best-fit model 
over 0.01 < k < 0.06/iMpc~^, while the solid line shows the best- 
fit model over 0.01 < fc < 0.15/iMpc-i. The two fits are discrepant 
at approximately 2-3 a. 

model, and then consider if this results in a contradic- 
tion in the cosmological parameters derived fitting to dif- 
ferent scales. This hypothesis effectively assumes that 
the contributions to the power spectrum from non-linear 
structure growth and redshift-space distortions cancel, 
while galaxy bias is scale-independent. Any contradic- 
tion would show (from a single data set), that this can- 
not be the case. Second, the parameter values Q ~ 26 
for the SDSS LRGs and Q ~ 4 for the 2dFGRS galaxies 
in the Q-model suggest that we should expect a strong 
change in the shape of the galaxy power spectrum as a 
function of galaxy properties. We split the large SDSS 
DR5 sample into subsamples to look for such effects on 
large scales 0.01 < fc < 0.15/iMpc~^ in Sections 7 & 8. 

6. Qm from the power SPECTRUM SHAPE 

In this section we fit CDM models to the SDSS DR5 
power spectrum data assuming that the observed power 
spectrum matches the shape of a linear CDM model. 
We have calculated a grid of model power spectra as 
described in Section 5, allowing the matter density Om 
and baryon fraction ilb/flM to vary, assuming a scalar 
spectral index of Ug ~ 0.96, and a Hubble parameter 
h = 0.73. Varying Us causes a small change in the re- 
covered value of the matter density: following Cole et al. 
(2005), the effect can be approximated by {0,Mh)tTue = 
(r^M ^)apparont + 0.3(1 — Us). Similarly, the effect of a 
non-zero neutrino fraction would change the recovered 
value by approximately {flMh)tTue = (^^M/i)apparent + 
1.2{Q^/flM)- For each value of the matter density tested, 
we have recalculated the window function from the ge- 
ometry of the sample using the appropriate comoving 
distance-redshift relation. This window function is used 
to convolve the model power, and is used to correct 
for the loss of power due to the normalisation of the 
overdensity field: because the total expected number of 
galaxies is unknown, the normalisation of the random 
catalogue was matched to the galaxy catalogue so that 
/ F{r) dr = (see Eq. 3, and the subsequent discus- 
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sion). The effect of this on the data is to subtract a 
muhiplc of the window function, so that Pdata(O) = 0. 
We therefore subtract a multiple of the window function 
from the model power spectra (after convolution with 
the appropriate window) so that Pmodei(O) = 0. This 
is the equivalent of the "integral constraint" correction 
required for measured correlation functions. The like- 
lihoods of the model power spectra are then calculated 
assuming that the data form a multi-variate Gaussian 
distribution with the appropriate covariance matrix, cal- 
culated as described in Section 4. 

The resulting likelihood surfaces arc plotted in Fig. 14. 
Contours are plotted for fits to two different fc-ranges, 
0.01 < fc < 0.06/iMpc~^ (dashed contours), and 0.01 < 
k < 0.15 /iMpc""'^ (solid contours). As can be seen, the 
choice of scales fitted makes a strong difference to the 
recovered best fit parameters. On scales 0.01 < k < 
0.06/iMpc~^, the shape matches that of a low matter 
density cosmology with Qm = 0.22 ± 0.04. Extending 
the fit to smaller scales, the data prefer a higher mat- 
ter density ^Im — 0.32 ± 0.01. The fits are discrepant 
at the 2 — 3cr level. The corresponding marginalised 
baryon fractions are rifc/r^M = 0.13±0.05 and flt/i^M = 
0.16 ±0.03. On scales 0.01 < fc < 0.06ft.Mpc"\ the 
small scale damping does not strongly affect the recov- 
ered power and we recover the same parameter con- 
straints if we do not make the small-scale damping cor- 
rection to the model baryon acoustic oscillations. Fit- 
ting to 0.01 < k < 0.15/iMpc~^ without the damping 
term reduces the baryon fraction constraint slightly to 
Qb/flM = 0.15 ±0.02. 

The results from fitting the power spectrum shape are 
only weakly dependent on the model assumed to cal- 
culate comoving distances from galaxy redshifts. We 
have considered fitting general CDM models to the single 
power spectrum calculated assuming a fixed cosmological 
distance model with Om = 0.24 (as plotted in Fig. 12), 
and likelihood contours are presented in Fig. 15, as for 
Fig. 14. The general form of the contours remains con- 
sistent, although the locations of the likelihood maxima 
do change - for the fit to 0.01 < A; < 0.15 /iMpc"^ we 
recover Qm = 0.31 ± 0.02. 

The best fit models are compared with the data 
in Fig. 16, which clearly shows why the discrepancy 
arises. There; is too much pow(;r on scales 0.01 < k < 
0.06 /iMpc~^ for the ^Im = 0.32 model at approximately 
1 a. Conversely, for the fit to the large-scale data, there 
is too much power on smaller scales k > 0.06 /iMpc~^. 

7. POWER SPECTRA OF SUB-CATALOGUES AND 
ROBUSTNESS TESTS 

In this section we present a number of test power spec- 
tra, calculated for variations of our method or for differ- 
ent subsamples of the data. The power spectra were cal- 
culated using a flat cosmological model with Qm = 0.24 
to convert from redshifts to comoving distance, and are 
compared in Figs. 17 & 18. In the left column they are 
divided by a linear power spectrum calculated with no 
baryons and with Qm = 0.18, which has approximately 
the correct shape but no baryonic features and, in the 
right column, to the same model convolved with the ap- 
propriate window function for each power spectrum, and 
with matched normalisation. For comparison we plot the 



power spectrum recovered using our standard analysis 
method in the top row of Fig. 17. 

1. We have split the catalogue into (slightly overlap- 
ping) main galaxy and LRG subsamples, and have 
calculated power spectra for these samples using 
our standard method and our standard luminosity- 
dependent bias model. Fig. 17 shows that the large- 
scale normalisation of the recovered powca' spectra 
match well compared with the relative average bias 
of the two populations. This shows that the bias 
model used is renormalising both power spectra to 
match that of an galaxy. There is weak evidence 
for a small normalisation offset, matching the small 
offset in the measured LRG bias compared with 
the model assumed (the open circles are slightly 
above the dashed line in Fig. 6). We will test the 
bias model later, and show that small deviations 
in the bias assumed for the LRGs compared with 
the main galaxies do not affect the power spectrum 
calculated from the combined sample (test 5). On 
small scales the LRG power has a higher amplitude 
compared with the main galaxy power. 

2. We test our angular mask by comparing recovered 
power spectra from samples of main galaxies se- 
lected to a limiting magnitude of r < 17.5 using 
the angular and radial selection methods described 
in Section 2, with a similar catalogue obtained from 
the SDSS New York University Value Added Cat- 
alogue (NYU-VAGC; Blanton et al. 2004). When 
analysing the NYU-VAGC sample we used their 
angular mask, which therefore matches the mask 
derivation previously used for SDSS team analy- 
ses (Tegmark et al. 2004; Eisenstein et al. 2005; 
Tegmark et al. 2006), although this calculation has 
obviously been updated to cover the SDSS DR5 re- 
gion. There are small changes between the NYU- 
VAGC mask and our mask, due to slightly different 
selection criteria and regions covered, but in the ar- 
eas of overlap they match well. We select a "safe" 
sample of main galaxies with a magnitude limit of 
r < 17.5 in order to provide a sample with a single 
radial distribution, and avoid complications from 
differing faint survey limits. For the NYU-VAGC 
catalogue we use the model provided for the ra- 
dial distribution of galaxies, which was calculated 
from fits to the luminosity function as described in 
Tegmark et al. (2004). For our sample we calculate 
the radial distribution of galaxies as described in 
Section 2.2. Even though these independent meth- 
ods vary substantially in design for both the angu- 
lar and radial selection, as can be seen in Fig. 17, 
there are only very minor differences between the 
recovered power spectra. 

3. We test the radial distribution assumed for the 

galaxy population by comparing the power spec- 
tra measured in 5 redshift slices of width Az = 
0.1 through the combinc;d sanipk^ of main galax- 
ies and LRGs. The standard luminosity-dependent 
bias model is assumed, and appears to adequately 
renormalise the power spectra calculated from the 
different samples there is no evidence for a sig- 
nificant change in normalisation of the power as 
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Fig. 17. — Power spectra calculated from the SDSS DR5 sample divided by a model linear power spectrum with h = 0.73, f2j\/ = 0.18, 
Us = 0.96 and no baryons (in order not to introduce extra oscillations in the plotted power spectra). In the left column, we simply divide 
by the raw model power spectrum, with a fixed normalisation matched to the final power spectrum from the combined LRG + Main galaxy 
sample. In the right hand column we convolve the model power spectrum with the window function appropriate to the test being performed 
and correct for the loss of power due to the normalisation of the overdensity field, before calculating the ratio of the power spectra. The 
amplitude is allowed to vary to match the data on scales 0.01 < k < 0.06/iMpc~^, so we are only comparing the shapes of the power 
spectra recovered from the different tests in the right column. The top row shows the ratios calculated for our final power spectrum derived 
from the combined main galaxy and LRG catalogue. The other rows show power spectra calculated either for different subsamples of this 
catalogue, or using slightly different techniques, and are described in Section 7. 
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a function of redshift. However, the magnitude 
limited catalogues from which the bias model was 
derived have redshift increasing with luminosity. 
The derivation of the model is therefore coupled 
with any redshift evolution. Consequently, it is 
perhaps not surprising that the amplitudes of the 
power spectra recovered in different redshift slices 
are so similar, although it is still gratifying to see 
that this is correct. A simple model with constant 
galaxy clustering as a function of redshift and with 
redshift-independent luminosity-dependent bias is 
therefore sufficient to model the currently observed 
clustering, but is not necessarily a unique solution. 

4. The main galaxy sample is bimodal in colour; 
wo have decomposed this dataset into red and 
blue galaxy subsamples using a simple colour cut 
Mo.ig — Mo.i^ = 0.8 (see Fig. 3). These subsam- 
ples were analysed in exactly the same way as our 
final combined sample. In particular, the redshift 
distribution fitting function of Eq. 1 was found to 
still provide an adequate match to the observed 
redshift distribution. The power spectra for these 
subsamples were corrected using our bias model, 
so the change in normalisation of the power spec- 
tra of the red and blue subsamples demonstrates 
the additional colour dependent bias term that is 
not included in our average bias model. As can be 
seen, there is no obvious change in the shape of the 
power spectra as a function of colour. 

5. We also consider a power spectrum calculated us- 
ing the bias model of Norberg et al. (2001) rather 
than the bias model of Tegmark et al. (2004). The 
primary effect is a change of normalisation, equiv- 
alent to a change in the definition of h■^, . Once this 
is corrected in the right hand panel of Fig. 18, we 
see no significant change in the recovered power 
spectra. We also consider an "offset" bias model, 
where we assume a model bias for the LRGs given 
by b/K = 0.85 + 0.15i/L, -I- 0.08(M* - Mo.i^), but 
do not change the bias model for the main galaxies. 
If this model is used to calculate the power in the 
LRG and main galaxy catalogues separately, the 
recovered large-scale power spectrum amplitude re- 
covered from the LRGs is reduced, and the LRG 
and Main galaxy power spectra are in better agree- 
ment (see test 1). For the combined sample, (or 
the LRG or main galaxy samples if analysed inde- 
pendently), this change does not affect the overall 
shape of the power spectrum, giving us confidence 
that any error introduced by joining the two galaxy 
catalogues is not significant. 

6. The angular coverage of the SDSS sample is now 
sufficiently large that we can split the catalogue 
as a function of Galactic extinction. The median 
r-band extinction in the sample is 0.065, and we 
split into galaxies with a higher extinction correc- 
tion, and galaxies with a lower extinction correc- 
tion. The recovered power spectra are not inde- 
pendent, and we have not calculated relative errors 
for these data. However, the power spectra diverge 
on scales k < 0.02 /iMpc"^, and are also slightly 
discrepant on scales k = 0.05 h Mpc~^, which could 



indicate a systematic problem with the extinction 
corrections. A recent test of the number density 
of SDSS galaxies as a function of the Schlegel et 
al. (1998) Galactic extinction correction was per- 
formed by Yahata et al. (2006). They find that the 
number density of galaxies increases with increas- 
ing extinction for SFD extinction values below 0.1 
magnitudes in the r-band. The contamination of 
the far infrared brightness of the sky by background 
galaxies postulated as an explanation of this effect 
in this work, might also explain the observed dif- 
ference between these power spectra. 

7. The final tests presented in Fig. 18 compare power 
spectra recovered for galaxies of different luminos- 
ity. For the main galaxies, we plot 4 of the 9 
pseudo- volume limited catalogues described in Sec- 
tion 3. Each catalogue is 0.5 magnitudes wide (not 
K-corrected) , and no bias model has been included. 
Consequently, the increase in the normalisation of 
the large-scale clustering as a function of luminos- 
ity is clear in the left column. 

8. As for (7), but now comparing 4 LRG sub- 
catalogues, each of width 0.5 magnitudes (not K- 
corrected). Looking at the left column, the increase 
in the overall clustering strength with luminosity is 
clear. In the right column, we see that any change 
in the shape of the power spectrum as a function 
of luminosity is at a level significantly below the 
change in the large-scale normalisation. The possi- 
bility that the power spectrum does change shape 
on scales 0.01 < k < 0.15/iMpc~^ as a function 
of luminosity will be considered further in the next 
Section. 



8. OBSERVED CLUSTERING AS A FUNCTION OF 

LUMINOSITY 

In this section, we expand on tests (7) and (8) pre- 
sented in Section 7 in order to consider whether there 
is any evidence that the power spectrum changes shape 
as a function of luminosity on scales 0.01 < k < 
0.15 /iMpc~^. In order to test this, we have created three 
pseudo- volume limited subcatalogues of the main galaxy 
sample, using the method described in Section 3.1, but 
now each of width 1-magnitude, covering —22.5 < M^. < 
— 19.5 (without K-correction). We have also split the 
LRG sample into two catalogues at approximately the 
median magnitude Mo-i^ = —22.3. Power spectra have 
been calculated for these five subcatalogues, and are plot- 
ted in Fig. 19, divided by a fiducial power spectrum 
model matched to each power spectrum as in the right 
column of Figs. 17 & 18. 

With the amplitude of the data matched on large 
scales, on small scales k > 0.2/iMpc~^, there is a clear 
hierarchy with more luminous galaxies showing stronger 
small-scale clustering. In order to quantify the effect on 
cosmological parameter estimation in the regime where 
the matter is still linear (fc < 0.15 /i Mpc~^), the im- 
portant question is "on what scale do the power spec- 
tra start to deviate from each other?". Unfortunately, 
the data are insufficient to answer this question in a 
quantitative way. For the samples split from the main 
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Fig. 18. — Continuation of Fig. 17. To accommodate the range of power spectrum normalisations in tests (7) and (8), the scale of the 
y-axis was changed in these panels. 
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Fig. 19. — Power spectra calculated from LRG and main galaxy 
subcatalogues divided by a smooth CDM model matched to the 
subcatalogue (as in the right panels in Fig. 17). The amplitude of 
the CDM model was matched to the data over the narrow fc-range 
0.03 < < 0.05/iMpc~^ For the main galaxies, 3 pseudo-volume 
limited catalogues of width 1-magnitude covering —22.5 < < 
— 19.5 were created and analysed (as described in Section 3.1). For 
the LRGs, the catalogue was split at approximately the median 
luminosity of the sample Mo.i^ = —22.3. 

galaxy catalogue, there is perhaps a slight trend with 
the least luminous galaxies having less power on scales 
0.06 < k < 0.15 ft. Mpc^^, but this conclusion is fairly 
speculative. 

9. DISCUSSION 

We have analysed the clustering of galaxies in the 
largest sample (in terms of number and volume covered) 
to date. We have shown that the SDSS main galaxy and 
LRG samples can be naturally combined, because both 
sets of galaxies, after careful luminosity selection, can be 
described using the same average large-scale luminosity- 
dependent bias model. Analysing the combination of 
data allows us to include cross-correlation between the 
two data sets as well as internal auto-correlations within 
either the LRG or main galaxy samples. 

Because of the speed and simplicity of the Fourier 
method used, we have been able to consider a number 
of tests of the recovered power spectrum, and have elim- 
inated a number of possible systematics from our analy- 
sis. The only unusual result was a possible problem with 
the Galactic extinctions; this requires further detailed 
analysis and modelling that is outside the scope of our 
paper. In addition to considering a larger data set than 
previously analysed, our analysis extends previous work 
by considering the cosmological model to be tested from 
the start of the analysis - using this model to convert 
from redshift to comoving distance, as well as comparing 
with the resulting power spectrum. 

In a companion paper we have considered the cos- 
mological constraints from the baryon acoustic oscilla- 
tions observed in the power spectrum, where we found 
^Ai = 0-2501qq2i for flat A cosmological models (Perci- 



val et al. 2006) when combined with the observed baryon 
acoustic oscillation scale in the CMB (Spergel et al. 
2006). This paper has instead focused on trying to model 
the overall shape of the power spectrum, paying particu- 
lar attention to systematic effects on the resulting mat- 
ter density constraints. Because of the accuracy with 
which we can now measure the power spectrum on scales 
k < 0.05 ft Mpc~^ as a consequence of the large volume 
covered by the LRG sample, we can test the hypoth- 
esis that the shape of the redshift-space galaxy power 
spectrum matches the matter on scales where the mat- 
ter clustering is well modelled by linear evolution from 
the seed perturbations (0.01 < k < 0.15ftMpc"^). We 
find a discrepancy between fits to fc < 0.06 ftMpc^^, and 
0.06 < k < 0.15ft.Mpc^^, albeit at only moderate sig- 
nificance. The data on scales 0.01 < k < 0.06ftMpc~^ 
favour a model with flM = 0.22 ± 0.04, while extending 
the fc-range to 0.01 < k < 0.15ftMpc~^ increases the 
estimated value to — 0.32 ± 0.01. Statistically, this 
discrepancy is only at the 2-3(T level. It is worth not- 
ing that the excess power on scales k < 0.06ftMpc~^ 
relative to the = 0.32 model matches that recently 
found from analyses of the SDSS photometric-redshift 
sample (Padmanabhan et al. 2006; Blake et al. 2006). 
This suggests that the discrepancy is therefore not lim- 
ited to our analysis. The upper k limit for the large-scale 
fit, Mim = 0.06 ft Mpc~^, was conservatively chosen based 
on the observed shape of the power spectrum (Fig. 12). 
The noise in the data means that there is nothing special 
in this choice, and we could have split the scales fitted at 
0.05 < fciini < O.lftMpc"^, and obtained similar results. 

Interestingly, the change in the recovered matter den- 
sity with the scales fitted matches the bimodality of pre- 
vious matter density constraints, with the best-fit model 
on scales 0.01 < k < 0.06ftMpc^^ matching that pre- 
dicted by the WMAP 3-year data (Spergel et al. 2006), 
the 2dFGRS (Cole et al. 2005), and the results from the 
baryon acoustic oscillations observed in the SDSS DR5 
sample. In contrast, the fits to 0.01 < k < 0.15ftMpc~^ 
match the results from early studies of the SDSS data by 
Pope et al. (2004) and Tegmark et al. (2004), suggest- 
ing that we are recovering approximately the same over- 
all shape on these scales as these early analyses. The 
new data obviously constrain f^A/ft with greater accu- 
racy so, on the scales previously considered, the signif- 
icance of the offset between 2dFGRS and SDSS anal- 
yses has increased. Additionally, the higher value of 
flM ~ 0.32±0.01 is now more discrepant with the (mean) 
WMAP constraint Qm = 0.234 ± 0.034 (from Table 2 
of Spergel et al. 2006), and is discrepant with the lower 
matter density recovered from the positions of the baryon 
acoustic oscillations in the power spectrum (Percival et 
al. 2006). 

The hypothesis being tested by these model fits is that 
the contributions to the power spectrum from non-linear 
structure growth and redshift-space distortions cancel, 
galaxy bias is scale-independent, and that the matter 
clusters as expected in a simple CDM model with an 
unbroken power spectrum. The results provide a contra- 
diction to this hypothesis with a significance of 2-3cr. We 
will now consider each of the elements in turn to try to 
understand which assumption is breaking down. First, 
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it is worth noting that the effect of non-hncar structure 
growth was discussed in Section 5 and shown to be too 
small to cause the observed power spectrum distortion. 

Small-scale redshift-space distortions act in the wrong 
direction for the observed effect, and it would require a 
significant break-down of the scale-independent increase 
in power spectrum amplitude predicted by Kaiser (1987) 
for linear infall observed at large distances to give a scale- 
independent increase in the power spectrum amplitude 
and cause the observed change in shape. N-body sim- 
ulations, and the halo model calculations performed for 
the 2dFGRS also found that the effect of redshift space 
distortions on the shape of the power spectrum is small 
on the scales of interest (Percival et al. 2001; Cole et 
al. 2005). The consistency between the higher matter 
density favoured fitting to 0.01 < k < 0.15 /iMpc^^ in 
our study, previous studies of the real-space power spec- 
trum (Tegmark et al. 2004), and the power spectrum 
calculated from sets of photometric-redshift LRGs also 
provide evidence against redshift-space distortions pro- 
ducing the observed effects. 

An unwelcome possibility is a systematic problem with 
one of the data sets, or the analysis method, although the 
only discrepancy revealed in the tests performed in Sec- 
tion 7 was between the power spectra recovered using 
different Galactic extinction correction cuts. However, 
this potential systematic could also explain differences 
between 2dFGRS and SDSS power spectra if it predom- 
inantly affects the SDSS galaxies; analysing the galaxy 
clustering in the low extinction regions in the SDSS, for 
which Yahata et al. (2006) show that the number den- 
sity of galaxies does not behave as expected, produces a 
slight excess of power on scales k ~ 0.06/iMpc~^ (see 
Fig. 18). Additionally, it is perhaps also worth men- 
tioning the hcTctical possibility that there is a problem 
with the assumption that the matter clusters as expected 
in a CDM model with an unbroken post-inflation power 
law spectrum. However, such a problem would affect the 
2dFGRS and SDSS galaxies in the same way, so a further 
explanation would be required for this difference. 

Perhaps the most simple explanation for this inconsis- 
tency is that the luminous red galaxies that dominate our 
combined sample do not trace the linear matter power 
spectrum as simply as other galaxies, and there is some 
theoretical work that supports this assertion. In order to 
try to obtain evidence for scale-dependent galaxy bias, 
we have analysed the shape of the SDSS power spectrum, 
particularly looking for evidence of an increase in small- 
scale clustering power that depends on galaxy proper- 
ties. The luminosity-bias correction applied as part of the 
Fourier method will only correct for the large-scale, scale- 
independent bias affecting different luminosity galaxies. 
It is not designed to correct for scale-dependent bias. 
We have analysed a number of subsets of our final cata- 
logue in order to look for scale-dependent bias, and find 
no significant change in shape if we split our sample us- 
ing the bimodal galaxy colour distribution, or if we split 
in redshift; we recover similar power spectrum shapes, 
analysing the data in 5 redshift slices out to 2; = 0.5. 
There is weak evidence for a change in the shape of the 
power spectrum for 0.01 < k < 0.15/iMpc~^ when split- 
ting the galaxies by luminosity, which varies with the 
average r-band luminosity of the galaxy sample anal- 



ysed. However, the evidence for this is not conclusive. 
It is worth noting that we clearly see the effects of colour 
and luminosity on smaller scales k > 0.2 /iMpc~^, in line 
with the results of Cole et al. (2005). If scale-dependent 
bias also affects large scales then, in principle, this is 
testable by measurement of the bispectriim on the same 
data set (Scoccimarro et al. 2006): the bispectrum shape 
and scale dependence respond to bias in a way that dif- 
fers from, and is therefore not degenerate with, the power 
spectrum. 

A change in power spectrum shape on scales k < 
0.15 /i Mpc^^, as a fimction of luminosity would pro- 
vide a consistent picture when we compare our recov- 
ered power spectrum with previous work. On scales 
k < 0.06/?, Mpc""', Fig. 13 shows that the shape is consis- 
tent with the 2dFGRS power spectrum (Cole et al. 2005). 
If the amplitudes of the power spectra are matched on 
these large scales, then on smaller scales there is a pro- 
gression from the low clustering amplitude of the 2dF- 
GRS galaxies, through the main galaxies of the DR3 
SDSS sample analysed by Tegmark et al. (2004) to the 
higher clustering strength of the combined main galaxy 
and LRG sample of this analysis. There is no reason to 
suggest that this is not a natural progression following 
the trend observed within the SDSS from galaxies with a 
low r-band luminosity to those with a high r-band lumi- 
nosity. In this interpretation, the "excess power" on the 
largest scales probed by the Padmanabhan et al. (2006) 
and Blake et al. (2006) analyses of photometric LRGs is a 
consequence of comparing to an incorrect reference power 
spectrum with high f^M- The true value of is lower, 
and the excess power is on small scales, as in Fig. 12 here. 
The observed clustering was calculated using a weighted 
average over all galaxy pairs. Consequently, if we analyse 
the DR5 LRGs and main galaxies separately and match 
the large-scale clustering amplitudes, we would expect 
a progression from a lower small-scale amplitiide of the 
main galaxies, through the combined sample, to a higher 
clustering amplitude of the LRGs. Fig. 19 shows that 
this is indeed what we find. 

The combination of scale-dependent bias, redshift- 
space distortions and non-linear structure growth for a 
set of galaxies has previously been matched by a simple 
fitting formulae (the Q-model) applied after calculation 
of the power spectrum (Cole et al. 2005). However, it 
remains to be seen if this simple prescription can ade- 
quately describe galaxy bias for LRGs or for a mixed 
sample of galaxies with different clustering properties; 
these issues will be considered in Tegmark et al. (2006). 
The analysis presented in our paper suggests that this 
correction could become increasingly important for sam- 
ples as the average r-band galaxy luminosity increases. 
It is clear that, for LRGs, the relation between galaxies 
and dark matter will need to be carefully modelled in 
order to extract the maximum possible information from 
the shape of the observed power spectrum. 

10. SUMMARY 

In summary, using the SDSS, the redshift-space galaxy 
power spectrum is now known with sufficient accuracy 
on large scales to test the link between galaxies and 
the underlying matter distribution within the class of 
CDM models. If we assume a scale-independent bias be- 
tween galaxies and the mass, we find that no linear CDM 
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model can fit the data over 0.01 < k < 0.15 /iMpc , 
with fits to 0.01 < k < 0.06/iMpc~^ suggesting a mat- 
ter density that is 2 — 3(t from that derived fitting to 
0.01 < k < 0.15 /iMpc~^. Perhaps the simplest ex- 
planation is that the large-scale distribution of lumi- 
nous red galaxies is affected by scale-dependent bias, 
although we cannot rule out the alternative possibility 
that this is due to a systematic effect. If we calcu- 
late power spectra for subsets of galaxies selected us- 
ing the r-band luminosity, then we only see weak evi- 
dence for a change in shape of the power spectrum on 
the appropriate scales, with galaxies that are less lumi- 
nous having a scale-dependent bias that is weaker and 
affects smaller scales than more luminous galaxies. Such 
a bias model could also explain the bimodality in matter 
densities calculated from other data sets and from pre- 
vious SDSS power spectra. The blue-selected 2dFGRS 
galaxies would be less affected by this scale-dependent 
bias than the red-selected galaxies and bias modelling 
would be less important when calculating cosmological 
constraints. Hence the simple assumption of Percival 
et al. (2001) that the 2dFGRS galaxy power spectrum 
has the same shape for k < 0.15/iMpc~^ as the matter 
power spectrum is comparatively harmless. As the r- 
band luminosity of the sample increases, the non-linear 
relation between the galaxies and dark matter needs to 
be carefully modelled when providing cosmological con- 
straints from galaxy clustering even on relatively large 
scales k < 0.15/iMpc~^ 
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